# Low-resource Language Optimization

Khmer Sentiment Xlm Roberta Base
MIT
Sentiment analysis model optimized for Khmer financial texts, capable of classifying positive/negative sentiments
Text Classification Transformers Other
K
songhieng
31
1
Asr Whisper Large V3 Salt
A speech recognition model adapted from whisper-large-v3, specifically optimized for multiple languages in Uganda
Speech Recognition Transformers Supports Multiple Languages
A
Sunbird
249
1
Jina Embeddings V3
Jina Embeddings V3 is a multilingual sentence embedding model supporting over 100 languages, specializing in sentence similarity and feature extraction tasks.
Text Embedding Transformers Supports Multiple Languages
J
jinaai
3.7M
911
Nllb 200 Ko Gec 3.3B
A multilingual text processing model supporting over 100 languages and writing systems, covering various Arabic dialects and minority languages
Large Language Model Transformers Supports Multiple Languages
N
sionic-ai
180
8
Aya 101
Apache-2.0
Aya 101 is a large-scale multilingual generative language model supporting instructions in 101 languages, outperforming similar models in various evaluations.
Large Language Model Transformers Supports Multiple Languages
A
CohereLabs
3,468
640
Mms 1b Fl102
MMS-1B-FL102 is part of Facebook's Massively Multilingual Speech project, an automatic speech recognition model supporting 102 languages, based on the 1-billion-parameter Wav2Vec2 architecture, achieving multilingual transcription through adapter technology.
Speech Recognition Transformers Supports Multiple Languages
M
facebook
6,360
26
Byt5 Small English
MIT
Historical multilingual and monolingual ByT5 base model, current version focuses on English text processing.
Large Language Model English
B
hmbyt5
30
1
Danskbert
Danish BERT is a language model optimized for Danish, excelling in the Danish ScandEval benchmark.
Large Language Model Transformers Other
D
vesteinn
151
6
Mbertu
A multilingual model for Maltese pre-trained on the Maltese corpus v4.0 based on multilingual BERT initial checkpoints
Large Language Model Transformers Other
M
MLRS
302
3
Bertu
BERTu is a monolingual Maltese model based on the BERT architecture, specifically designed for low-resource languages, supporting various natural language processing tasks.
Large Language Model Transformers Other
B
MLRS
4,486
6
Xglm 2.9B
MIT
XGLM-2.9B is a multilingual autoregressive language model with 2.9 billion parameters, trained on a diverse and balanced corpus of 500 billion subword tokens across multiple languages.
Large Language Model Transformers Supports Multiple Languages
X
facebook
229
9
Xlm Roberta Base Finetuned Ner Naija
A named entity recognition model fine-tuned based on xlm-roberta-base, specifically optimized for Nigerian Pidgin
Sequence Labeling Transformers Other
X
mbeukman
17
0
Xglm 1.7B
MIT
XGLM-1.7B is a multilingual autoregressive language model with 1.7 billion parameters, trained on a diverse and balanced corpus of 500 billion subword tokens.
Large Language Model Transformers Supports Multiple Languages
X
facebook
1,514
19
Wav2vec2 Large Xls R 300m Bulgarian
Apache-2.0
A Bulgarian speech recognition model fine-tuned on the MOZILLA-FOUNDATION/COMMON_VOICE_7_0 - BG dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition Transformers Other
W
infinitejoy
10.59k
2
Unispeech 1350 En 17h Ky Ft 1h
A speech recognition model based on Microsoft's UniSpeech architecture, specifically fine-tuned for the Kyrgyz language
Speech Recognition Transformers Other
U
microsoft
39
1
Wav2vec2 Large Xls R 300m Galician
Apache-2.0
This is an automatic speech recognition model fine-tuned on Galician speech datasets based on facebook/wav2vec2-xls-r-300m.
Speech Recognition Transformers Other
W
infinitejoy
31
0
Afriberta Base
AfriBERTa is a multilingual pretrained model supporting 11 African languages with 111 million parameters, suitable for tasks like text classification and named entity recognition.
Large Language Model Transformers
A
castorini
697
2
Clinicalnerpt Medical
Portuguese clinical named entity recognition model based on BioBERTpt, supporting 13 UMLS-compatible clinical entity types
Sequence Labeling Transformers Other
C
pucpr
55
6
Clinicalnerpt Finding
Portuguese clinical named entity recognition model based on BioBERTpt, supporting 13 UMLS-compatible clinical entity types
Sequence Labeling Transformers Other
C
pucpr
49
5
Roberta Tagalog Large
A Filipino RoBERTa model trained on the TLUnified corpus, improved from previous versions with case-sensitive processing support.
Large Language Model Transformers Other
R
jcblaise
534
3
Opus Mt En Mg
Apache-2.0
A Transformer-based machine translation model from English to Malagasy, developed by the Helsinki-NLP team.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
83
0
Opus Mt En Loz
Apache-2.0
Transformer-based English to Lozi machine translation model developed by the Helsinki-NLP team
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
30
0
Opus Mt Mr En
Apache-2.0
Marathi to English machine translation model based on OPUS dataset, using transformer-align architecture
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
1,256
0
Opus Mt En Lus
Apache-2.0
A Transformer-based English to Mizo (Lushai) machine translation model developed by the Helsinki-NLP team.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
48
0
Opus Mt En Cy
Apache-2.0
opus-mt-en-cy is a machine translation model based on the transformer-align architecture, specifically designed for translating English to Welsh. This model was developed by the Helsinki-NLP team and trained on the OPUS dataset.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
261
0
Opus Mt En St
Apache-2.0
opus-mt-en-st is a machine translation model based on the Transformer architecture, specifically designed for translating English to Southern Sotho.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
58
1
Opus Mt Fr Rw
Apache-2.0
OPUS-MT machine translation model from French to Kinyarwanda, based on Transformer architecture, trained using OPUS datasets.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
17
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase